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Ch I We use hydrodynamic cosmological simulations to predict correlations be- 

1-^ I tween Lja forest absorption and the galaxy distribution at redshift z ^ 3. The 

I probability distribution function (PDF) of Lya flux decrements shifts systemat- 

I ically towards higher values in the vicinity of galaxies, reflecting the overdense 

^ I environments in which these galaxies reside. The predicted signal remains strong 

' in spectra smoothed over 50 — 200 km s~^, allowing tests with moderate reso- 

^ I lution quasar spectra. The strong bias of high redshift galaxies towards high 

0\ ' density regions imprints a clear signature on the flux PDF, but the predictions 

^ ' are not sensitive to galaxy baryon mass or star formation rate, and they are sim- 

O ■ ilar for galaxies and for dark matter halos. The dependence of the flux PDF on 

■ galaxy proximity is sensitive to redshift determination errors, with rms errors of 
I ' 150 — 300 km s~ substantially weakening the predicted trends. On larger scales, 

^ • the mean galaxy overdensity in a cube of 5 or 10 Mpc (comoving) is strongly 

^ ■ correlated with the mean Lya flux decrement on a line of sight through the cube 

• center. The slope of the correlation is ~ 3 times steeper for galaxies than for 

■ dark matter as a result of galaxy bias. The predicted large scale correlation is 
' in qualitative agreement with recently reported observational results. However, 



observations also show a drop in the average absorption in the immediate vicin- 
ity of galaxies, which our models do not predict even if we allow the galaxies or 
AGNs within them to be ionizing sources. This decreased absorption could be a 
signature of galaxy feedback on the surrounding intergalactic medium, perhaps 
via galactic winds. We find that a simplified "wind" model that eliminates neu- 
tral hydrogen in spheres around the galaxies can marginally explain the data. 
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However, because peculiar velocities allow gas at large distances to produce sat- 
urated absorption at the galaxy redshift, these winds (or any other feedback 
mechanism) must extend to comoving radii of ~ 1.5 h^^ Mpc to reproduce the 
observations. We also discuss the possibility that extended Lya emission from 
the target galaxies "fills in" the expected Lya forest absorption at small angular 
separations. 



1. Introduction 

The strong clustering of Lyman-break galaxies (LBGs) at ^ ~ 3, comparable to that of 
present-day, optically selected galaxies, suggests that they are highly biased tracers of the 
underlying dark matter distribution (Adelberger et al. 1998, 2002a). This bias appears to 
arise naturally in semi-analytic models and hydrodynamic numerical simulations (e.g., Baugh 
et al. 1998; Governato et al. 1998; Katz, Hernquist, & Weinberg 1999; Kauffman et al. 1999; 
Cen & Ostriker 2000; Benson et al. 2001; Pearce et al. 2001; Yoshikawa et al. 2001; Weinberg 
et al. 2002a), which predict that the luminous members of the high redshift galaxy population 
reside in massive halos, which in turn reside in regions of high background density (Kaiser 
1984; Mo & Fukugita 1996; Mo & White 1996). However, the bias of Lyman-break galaxies 
is inferred by comparing their observed clustering to the predicted clustering of dark matter, 
which depends on the assumed cosmological model. The Lja forest offers a tracer of structure 
whose relation to the underlying dark matter distribution appears to be well understood on 
theoretical grounds, probing the same redshifts as the LBG population. Correlations between 
LBGs and Lya forest absorption therefore offer a natural and potentially powerful probe of 
the relation between high redshift galaxies and the dark matter distribution, and perhaps 
for the influence of high redshift galaxies on the surrounding intergalactic medium (IGM). 

In this paper we use smoothed particle hydrodynamics (SPH) simulations to predict the 
correlations between z — 3 galaxies and Lya forest absorption. Hydrodynamic simulations 
are ideal for this purpose, since they simultaneously predict the locations and properties of 
the galaxies (e.g. Katz, Hernquist, & Weinberg 1999; Nagamine et al. 2001; Weinberg et al. 
2002b) and the structure and ionization state of the intergalactic gas that produces the Lya 
forest (e.g. Cen et al. 1994; Zhang, Anninos, &: Norman 1995; Hernquist et al. 1996; Theuns 
et al. 1998). However, the basic expectations can be understood in simple terms, using the 
"Fluctuating Gunn-Peterson Approximation" (Bi & Davidsen 1997; Croft et al. 1997; Ranch 
et al. 1997; Weinberg, Katz, & Hernquist 1998; Croft et al. 1998), which provides a fairly 
accurate description of the Lya forest results from full hydrodynamic calculations. In this 
approximation, the Lya flux decrement at wavelength A is related to the overdensity at a 
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redshift z — X/Xa,rest — 1 along the line of sight by 

L> = 1 - 1^ = 1 - e-- = 1 - exp [-A{p/pf] , (1) 

with 

^ = 694 (l±±X rM!V (^) (—V (^Mllh] " ( I V 

V 4.0 J \om J VeoooKy \o.65j \ 5.12 J x sec-^ J 

(2) 

Here h = ifo/100 km s^^ Mpc ^, ilb is the baryon density parameter, and F is the HI 
photoionization rate due to the diffuse UV background at redshift z. This approximation 
assumes that all gas obeys a temperature-density relation T — To{pij/ pi,)"', which emerges 
from the interplay between photoionization heating and adiabatic cooling caused by cosmic 
expansion. For typical reionization histories, one expects Tq ~ 5000 — 20, 000 K and p = 
2 — 0.7a ~ 1.6 — 1.8 at ;i; ~ 3, with the higher Tq and f3 values arising if helium reionization 
occurs close to this redshift (Hui & Gnedin 1997). Strictly speaking, the overdensity in 
equation (1) is the gas overdensity, but in the moderate density regions that contribute most 
of the Lya forest opacity, the gas pressure is low, and the gas and dark matter trace each 
other fairly well. This approximation ignores the effects of coUisional ionization and peculiar 
velocities. 

From equation (1), one can sec that the probability distribution function (PDF) of 
Lya forest flux decrements, P{D), is closely related to the PDF of the underlying density 
field, P{p/p), with an effective smoothing scale determined by a combination of pressure 
support and thermal broadening along the hue of sight. The flux decrement PDF can 
therefore provide a diagnostic for the amplitude of mass fluctuations and for departures from 
Gaussianity in primordial fluctuations (Weinberg et al. 1999a; Groft et al. 1999; McDonald 
et al. 2000). 

The statistical measure that we focus on in this paper is the conditional flux decrement 
PDF, the probabihty P{D) that a pixel in a Lya forest spectrum has a flux decrement in the 
range D — > D + dD, conditioned on the presence of a galaxy at the same redshift close to the 
line of sight. (Since D = 1 — F/Fc, we will use the slightly less cumbersome term "conditional 
flux PDF," but we treat D as the observable of interest because it is an increasing function 
of density.) This statistic can be measured by obtaining Lya forest spectra in flelds covered 
by LEG surveys. The bias of galaxies towards high density regions should reveal itself as 
a systematic shift of P{D) towards higher flux decrements in pixels close to galaxies, with 
closer proximity yielding stronger shifts. Gonversely, feedback of galaxies on the local IGM 
via ionization or galactic winds might shift the PDF towards lower flux decrements at small 
separations. 
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A complete measurement of the conditional flux PDF will require ambitious, time- 
consuming surveys on large telescopes, since each LBG-quasar pair contributes only a single 
point to the distribution. However, characteristics of the conditional PDF like the mean 
decrement or the fraction of saturated and "transparent" pixels can be measured with fewer 
data points, and we present predictions for these quantities in addition to the full PDF. We 
also show that the signature of galaxy proximity should persist in spectra smoothed over 
50 — 200 km s~^, allowing theoretical predictions to be tested using relatively low resolution 
quasar spectra. With higher resolution spectra the dependence of the conditional PDF on 
spectral smoothing offers a further test of the models. 

K. Adclberger and collaborators have been carrying out a survey along the lines en- 
visioned here (Adelberger et al. 2002a, hereafter ASSP; Adelberger et al. 2002b, hereafter 
ASPS). While we have not designed our study primarily for comparison with their results, 
we will discuss the comparison between our predictions and some of their measurements in 
§5 below. Inspired by their work, we also consider an additional statistic, the mean galaxy 
overdensity in 5 and 10 Mpc (comoving) cubes as a function of the mean decrement 
on a line of sight through the middle of the cube. This statistic characterizes the large 
scale correlations between galaxies and IGM overdensity, while the conditional flux PDF 
characterizes correlations on smaller scales. McDonald, Miralda-Escude, & Cen (2002) have 
presented predictions for this statistic using dark matter in N-body simulations, and here 
we show predictions for the biased galaxy population of a hydrodynamic simulation. 

There is substantial overlap between our investigation and the recent paper of Croft et 
al. (2002a). As discussed in §5, our results generally agree well with theirs where we examine 
similar quantities. Croft et al. (2002a) devote considerably more attention to models that 
incorporate galactic winds, which they add to their simulations using the post-processing 
approach of Aguirre et al. (2001). Here we concentrate on the direct predictions of our 
simulations, which incorporate thermal feedback in the local ISM that is usually radiated 
away before it can drive a galactic wind. However, we do estimate the possible effects of 
galaxy photoionization, and of photoionization by recurrent AGN activity associated with 
galaxies, and we briefly discuss the possible influence of stronger winds. We describe our 
simulations, galaxy identification, and spectral extraction in the next section. We present our 
results for the conditional flux PDF in §3 and for the correlation between mean decrement 
and galaxy counts in cubic cells in §4. In §5, we briefly review our results, then discuss them 
in relation to the theoretical study of Croft et al. (2002a) and the observational analyses of 
ASSP, concluding with some remarks about future directions. 
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2. Method 

Our results are derived from SPH simulations of an inflationary cold dark matter model 
with a cosmological constant (ACDM), with cosmological parameters fi„i = 0.4, Q\ = 0.6, 
h = 0.65, fib = 0.02/i~^ = 0.0473, an inflationary spectral index n = 0.95, and a power 
spectrum normalization as — 0.80. These simulations use a parallel implementation of 
TreeSPH (Hernquist & Katz 1989; Katz, Weinberg & Hernquist 1996, hereafter KWH; Dave 
et al. 1997), incorporating radiative cooling and star formation as described by KWH. We 
rely mainly on two simulations, one that uses 128^ SPH particles and 128^ dark matter 
particles in a periodic cube 22.222 Mpc (comoving) on a side, and one that uses 2 x 144^ 
particles in a cube 50 h-^ Mpc on a side. We use two additional simulations to investigate 
numerical resolution effects. We refer to simulations with the nomenclature LXnY, where the 
box size is X/i~^ Mpc and the particle number is for each species. The main simulations 
are thus L22nl28 and L50nl44, and the additional simulations are Llln64 and Lllnl28. 
The properties of all of these simulations are summarized in Table 1. L22nl28 and Llln64 
have the same resolution as simulations that we have previously used to study the Lyct forest 
(Hernquist et al. 1996; Dave et al. 1999), while Lllnl28 has a mass resolution a factor of 
eight higher. The L50nl44 simulation was designed primarily for studies of galaxy assembly, 
galaxy clustering, and the low redshift intergalactic medium (Dave et al. 2001, 2002; Croft 
et al. 2001; Chen et al. 2002; Fardal et al. 2002; Murali et al. 2002; Weinberg et al. 2002a). 
It has a substantially larger volume than our other simulations, but its mass resolution is 
a factor of eight lower than that of L22nl28. We analyze all of the simulations at redshift 
z — 3, close to the mean redshift of the main LEG samples. 

With the exception of L50nl44, all of the simulations were evolved with the photoion- 
izing UV background calculated by Haardt & Madau (1996). At the processing stage, the 
intensity of the background is adjusted by small factors (0.75 — 1.16, depending on the simu- 
lation) so that the mean decrement of the Lya forest spectra extracted from the simulation 
matches the value D — 0.36 found by Press, Rybicki, & Schneider (1993) at z — 3 (Bernardi 
et al. 2003 find a nearly identical value with a much larger quasar sample). The L50nl44 
simulation was run with no ionizing background (for the reasons discussed by Weinberg, 
Hernquist & Katz 1997), so its unshocked gas cools adiabatically to unrealistically low tem- 
peratures. We therefore modify the temperatures of all gas particles that have (p/p) < 100 
and T < 10^(p/p)°'^ K, setting them to T = 10^(p/p)°'^ K. We do not modify the tem- 
peratures of hotter particles. In this way, we mimic the results expected if we had used an 
ionizing background during evolution, and the tests in §3.1 below indicate that this approach 
is adequate for our purposes. We again extract spectra with an ionizing background intensity 
chosen to reproduce the Press, Rybicki, & Schneider (1993) mean fiux decrement. 
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The star formation algorithm is discussed in detail by KWH. In brief, cold gas with 
physical density uh > 0.1 cm~^ and overdensity p/p > 1000 is converted into stars on 
a timescale determined by the local dynamical and cooling times. Feedback is deposited 
as thermal energy in the surrounding medium, and it is usually radiated away before the 
interstellar medium becomes hot enough to drive a "galactic wind." The SKID algorithm^ 
(Spline-Kernel Interpolated DENMAX; see KWH and Gelb & Bertschinger 1994) is used 
to identify clumps of stars and cold gas (T < 30, 000 K, p/p > 1000) that correspond 
to "galaxies." Because these systems are extremely overdense, there is no ambiguity in 
their identification, and their masses are robust to changes in the details of the radiative 
coohng or star formation treatment (KWH; Weinberg, Hernquist & Katz 1997). We restrict 
our attention to galaxies with baryonic mass (stars plus cold gas) Mh > 64mspH, since 
our comparisons among simulations of different resolution indicate that this population is 
insensitive to resolution effects. 

The simulations produce a population of high redshift galaxies whose clustering, star 
formation properties, and sub-mm luminosities are in approximate agreement with obser- 
vations (Katz, Hernquist, & Weinberg 1999; Weinberg et al. 1999b, 2002a,b; Fardal et al. 
2002), though the uncertainties in some of these comparisons are substantial. Correlations 
with the Lya forest offer the possibility of testing the simulation predictions for the relation 
between these galaxies and the underlying mass distribution. The simulated galaxies span 
a wide dynamic range in baryonic mass and star formation rate (SFR), and we want to 
know whether the predicted correlations with the Lya forest depend substantially on galaxy 
properties. Rather than model the galaxy spectral energy distributions in detail, we define 
galaxy samples above thresholds in baryonic mass or SFR and characterize them by their 
space density, 

y = ^g^' (Q) 

ei^ X (A.)box ' ^ ^ 

where A'gai is the number of galaxies above the threshold in the simulation box, ^box is 
the projected size of the box in arc-minutes, and (A2;)box is the redshift depth of the box. 
The units of E are thus number per square arc-minute per unit redshift (which we will 
write arcmin"^ A^;"^), and it is worth emphasizing that "per unit redshift" makes E a space 
density rather than a surface density. To the extent that galaxies' rest-frame UV luminosities 
are monotonically related to their instantaneous star formation rates, samples selected above 
an SFR threshold should correspond to observed LBG samples above an optical magnitude 
limit. Typical LBG spectroscopic samples to R ^ 25.5 have S ~ 1 — 3arcmin~^ Az~^ (Steidel 
et al. 1996; Adelberger et al. 1998). 



^http : //www-hpcc . astro .Washington. edu/tools/skid.html 
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Lya forest spectra are extracted using TIPSY^, as described by Hernquist et al. (1996). 
Briefly, the optical depth at each position is calculated from the neutral hydrogen mass, neu- 
tral mass-weighted velocity, and neutral mass-weighted temperature, derived by integrating 
through all of the SPH smoothing kernels that overlap that spatial position. We extract 
1200 spectra along random lines of sight (400 in each projection) for each simulation. We 
use "pixels" of 1.5 — 3 km s~^, small enough that variations on the thermal broadening scale 
~ 10 km s~^ are fully resolved. As discussed below, we compute statistics after smoothing 
these spectra with Gaussians of la width 10 — 200 km s~^, with the smallest of these scales 
representing the approximate spectral resolution of typical high resolution QSO spectra. We 
do not add noise to our artificial spectra; in general, noise will act to broaden measured 
distribution functions but will average out in measurements of mean decrements. To com- 
pute the conditional flux PDF statistics discussed in the next section, we search around each 
sightline for nearby galaxies in the cylindrical volume deflned by a given search radius and 
the depth of the box. This method yields few matches at small angular separations or for 
rare galaxy classes, so we supplement the random spectra with spectra extracted close to the 
known positions of the galaxies, with the distribution of projected separations that would 
be expected for random positioning. 



3. The Conditional Flux PDF 

We begin by examining the physical conditions of the IGM in the vicinity of galaxies. 
Figure 1, based on the L22nl28 simulation, shows the median value, interquartile range, and 
10-90% range of the temperature, gas overdensity, neutral hydrogen density, and neutral 
hydrogen fraction in pixels that lie at the same redshift as a simulated galaxy at transverse 
separation (in comoving Mpc). For our adopted cosmological parameters, an angular 
separation of 1' at z = 3 corresponds to rt = 1.19 Mpc (comoving), and to Hrt — 
152 km s~^ As expected, the gas density increases close to galaxies. Typical temperatures 
are also higher in the vicinity of galaxies. In the median, this temperature increase is 
mostly accounted for by the increased photoionization heating at higher densities, but the 
spread to high temperatures is probably an effect of shock heating as gas falls into collapsed 
regions. In our simulations, supernova heating should have little impact at the separations 
shown in Figure 1. While higher temperatures counteract the influence of higher densities, 
the density effect is stronger, and typical neutral hydrogen densities increase sharply in 
the neighborhood of galaxies. The median neutral fraction is roughly constant at ~ 10~^, 
increasing at rt < 100 kpc. 



^http : //www-hpcc . astro . Washington . edu/tools/tipsy/t ipsy . html 
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Figure 1 motivates our use of the statistic described in the introduction, the conditional 
PDF of the flux decrement D in pixels within angular separation ^^ax of a galaxy at the 
same redshift. We will investigate the dependence of this statistic on the angular separation 
threshold ^^maxj on the smoothing of the Lya forest spectrum, on the velocity separation 
between the pixel redshift and the galaxy redshift, and on the properties of the galaxies 
themselves. In addition to the complete conditional PDF, we calculate three quantities that 
summarize some of its properties: the (conditional) mean decrement {D) = £ DP{D)dD, 
the fraction /sat of "saturated" pixels with D > 0.9, and the fraction /trans of "transparent" 
pixels with D < 0.55. Before presenting our main results from the L22nl28 and L50nl44 
simulations, we first test the sensitivity of the predictions to numerical parameters of the 
simulations. 

In general, one can consider the dependence of the flux PDF on both the angular separa- 
tion between the galaxy and the QSO sighthne and the velocity separation AV between 
the galaxy redshift and the pixel redshift. For the most part, we concentrate on the AV — 
slice of this more general distribution P[D\A6, AV), so for each galaxy-sightline pair we con- 
sider the single pixel closest to the galaxy redshift, incorporating the effects of peculiar ve- 
locity on both the galaxy redshift and the Lja forest spectrum. Alternatively, one can define 

1 /2 

the conditional flux PDF in bins of redshift-space separation A,. = [(d^A^)^ -|- (AV/Hy] , 
where dA is the angular diameter distance and H the Hubble parameter at redshift z. There 
are three disadvantages to the latter approach: it requires a cosmological model to compute 
dA and H{z) (which do not scale the same way with cosmological parameters), peculiar 
velocities distort the relation between AV and distance, and the statistical properties of the 
measured distributions are more complex because each galaxy-quasar pair contributes cor- 
related values to P{D) in different bins of A,,. However, the A,, deflnition has a signiflcant 
advantage when computing the conditional mean decrement from a small sample of moder- 
ate S/N spectra, namely that each spectrum contributes many pixels and the noise averages 
out. Our definition requires either high S/N per pixel or a large number of galaxy-QSO 
pairs to obtain a precise measurement of the conditional mean decrement. In practice, we 
recommend smoothing spectra enough to ensure high S/N per pixel, since we show below 
that smoothing over AV ~ 50 — 100 km s~^ does not reduce the discriminating power of the 
measurements. This way explicit smoothing achieves the noise suppression that is obtained 
by binning in the A^ approach. In §5, wc will calculate the mean decrement in bins of A^ 
for comparison to Croft et al.'s (2002a) predictions and ASSP's measurements. 
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3.1. Dependence on Numerical Parameters 

We investigate the influence of mass resolution using the Lllnl28 and Llln64 simula- 
tions, which have the same initial conditions and box size but a factor of eight difference 
in particle mass. Figure 2 presents the conditional flux PDFs of these simulations, in a 
format similar to that used in many of our subsequent figures. The left hand panel shows 
results for spectra smoothed with a Gaussian of la width Rg — 10 km s~^, and the right 
hand panel for Rg — 50 km s~^. Solid curves show the unconditional PDF, computed from 
all pixels in all of the randomly positioned spectra. Dotted curves show the conditional 
PDF for pixels within 2' of galaxies at the same redshift. The physical mass threshold for 
these galaxies is the same in the two simulations, corresponding to 64mspH in Llln64 and to 
512mspH in Lllnl28. Dashed curves show the conditional flux PDF for angular separations 

< 0.5'. Symbols along the top show the mean decrements computed from these three 
PDFs. Symbols along the right and left hand vertical axes denote the fractions of saturated 
and transparent pixels, respectively. 

We will discuss the physics behind these conditional PDFs in §3.2 below, though the ba- 
sic behavior is clearly what one would expect based upon the general discussion in §1, given 
the trends in Figure 1. The crucial point of Figure 2 is the similarity between results rep- 
resented by the bold lines/symbols and those represented by the light lines/symbols, which 
come from the Lllnl28 and Llln64 simulations, respectively. While there are systematic 
differences, most notably in the number of transparent pixels for < 0.5', these differences 
are quite small, especially compared to the much stronger dependence on the separation 
itself. Figure 2 implies that the L22nl28 simulation, which has the same mass resolution as 
Llln64 and a larger volume, should yield reliable predictions for galaxies above its 64mspH 
resolution threshold. 

Comparing the Llln64 and L22nl28 simulations allows us to test the influence of box 
size on the conditional flux PDF. The simulations cannot include density perturbations 
on scales larger than the fundamental mode of the box, and this absence of large scale 
fluctuations could alter the statistical properties of the Lya forest or the bias of galaxies 
with respect to the mass. We computed conditional flux PDFs for galaxy samples (selected 
on SFR) with a space density of E = 3 arcmin"^ Az~^ in the two simulations. We found 
only small differences in the results (and therefore do not plot them), with a slight shift of 
the distribution in the smaller box towards an increased number of low decrement pixels 
and a corresponding dearth of high decrement pixels. The conditional mean decrement is 
thus slightly depressed in the smaller box, but again this impact is much smaller than the 
impact of galaxy proximity itself. We conclude that the lack of large scale modes in the 
Llln64 simulation has only a minimal impact on the conditional flux PDF for galaxies of 
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this space density, and that the box size of the L22nl28 simulation should be adequate for 
our purposes. The bias of rarer, more massive (or higher SFR) galaxies might be more 
sensitive to the absence of long wavelength modes, but for rare galaxies one needs a large 
simulation volume to get adequate statistics in any case. 

Figure 3 compares the results of the L50nl44 and L22nl28 simulations, in the same 
format as Figure 2. Wc select galaxies above the same physical mass threshold, which corre- 
sponds to 64mspH and 512mspH, respectively. The space densities are S ^ 3 arcmin"^ A^;"^. 
In this comparison, differences could arise from random differences in the structure present, 
or from systematic effects of box size, mass resolution (which is another factor of eight lower 
in L50nl44), or our post-facto imposition of a temperature-density relation in L50nl44 (see 
§2). The good agreement in Figure 3 shows that the combined impact of the systematic 
effects must be reasonably small. In what follows, therefore, we will use L22nl28 to make 
predictions for abundant galaxy populations with E > 3arcmin' A^-^ and L50nl44 to 
make predictions for rarer populations, where its larger volume is needed to provide ade- 
quate statistics. The differences in Figure 3 are probably an upper limit on the level of 
systematic uncertainty associated with the numerical parameters of the simulations. Even 
the noticeable difference in the mean decrement for A^^ < 2' is not highly significant relative 
to the statistical uncertainties, since there are only 37 galaxies above the mass threshold 
in the L22nl28 simulation. We conclude that the trends predicted by our simulations are 
robust to the effects of finite resolution and box size and that our quantitative results are 
only mildly sensitive to these numerical effects. 



3.2. Predictions of the Conditional Flux PDF 

Figure 4 presents the conditional fiux PDF for the full population of resolved galaxies 
{Mb > 64mspH = 6.7 x lO^M©) in the L22nl28 simulation. There are 641 galaxies in the 
box at 2; = 3, and the space density of this population is E = 48 arcmin"^ A^;"^. (As 
indicated in the note to Table 1, the box is 18.74' on a side and has a redshift depth 
Az — 0.0379 = 2844km s~^.) The upper left panel is for spectra smoothed over 10km s~^, 
comparable to the spectral resolution of typical Keck HIRES or VLT UVES spectra. The 
unconditional fiux PDF, shown by the solid curve, refiects the probability distribution of 
gas overdensity P{p/p), which depends mainly on the amplitude of matter fiuctuations at 
z = 3 and on the assumption that the primordial fiuctuations were Gaussian (sec, e.g., the 
discussions by Miralda-Escude et al. 1996, 1997; Cen 1997; Ranch ct al. 1997; Weinberg et 
al. 1999a; Croft et al. 1999; McDonald et al. 2000). The shape of P{D) is, of course, strongly 
influenced by the non- linear transformation between density and flux decrement (eq. 1); for 
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example, the high and low density tails of P{p/ p) map to flux decrements D k,1 and D 
respectively, so P{D) peaks at these extremes. 

Dot-dashed, dotted, and dashed curves show P{D) for pixels that have a galaxy at 
the same redshift within ^max = 2', 1', and 0.5', respectively (comoving separations of 2.4, 
1.2, and 0.6 Mpc). The preferential location of galaxies in overdense regions reveals 
itself through shifts of the distributions towards higher flux decrements, with corresponding 
increases in the mean decrement (symbols along the top axis) and the fraction of saturated 
pixels (symbols along the right axis), and decreases in the fraction of transparent pixels 
(symbols along the left axis). These shifts become progressively stronger as the proximity 
to galaxies increases (6'inax decreases). Pixels that contribute to the /S.6 < 0.5' histogram 
also contribute to the < 2' histogram, but the number of galaxy-spectrum separations 
is proportional to (A^)^, so the conditional PDF for a given ^max is contributed largely by 
separations close to ^max- A sphere of radius 2.4 h"^ Mpc (A^ = 2') contains 3.4 galaxies on 
average, so for the values of ^max in Figure 4 the galaxy "matched" to any given pixel is not 
necessarily the one with the smallest 3-d physical separation. For the brighter, lower space 
density samples defined later, the matched galaxy is usually also the closest one. 

The remaining panels of Figure 4 show the unconditional and conditional flux PDFs 
for spectra smoothed over 50, 100, and 200km s~^. Smoothing brings the flux decrements 
in extreme pixels closer to the mean, so for larger Rg (which represents the la width of 
the Gaussian smoothing filter) the fractions of saturated and transparent pixels drop. With 
Rg = 200 km s^^ the unconditional PDF peaks close to the mean decrement, though it 
remains fairly broad. Roughly speaking, the fiux PDF of smoothed spectra refiects the 
PDF of density smoothed on a similar physical scale, but the smoothing of the spectra is 
1-d rather than 3-d, and smoothing does not commute with the non-linear transformation 
between density and flux. 

For our purposes, the important feature of these panels is that the difference between 
the unconditional and conditional flux PDFs, and between the conditional PDFs for different 
^max, persists for smoothing lengths up to 200 km s~^, and is perhaps even enhanced relative 
to the minimal smoothing case. Since we see a strong signature of galaxy proximity at 
A^ — 2' = 304 km s~^, it is no surprise that the signature persists when spectra arc smoothed 
over this physical scale; the difference between the unconditional and conditional PDFs 
reflects the large scale environments in which galaxies reside. Figure 4 shows that the 
simulation predictions can be tested with moderatc-to-low resolution spectra, thus allowing 
the use of relatively faint AGN as targets for observational studies. 

The principal difficulty with lower resolution spectra is the greater difficulty of contin- 
uum fitting, which causes systematic underestimation of flux decrements, especially at higher 
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redshifts where large expanses of unabsorbed continuum are rare. However, the quahtative 
trends with galaxy proximity should persist even in the presence of continuum fitting errors. 
With high resolution spectra, the dependence of the conditional flux PDF (or properties 
like (-D), /sat, and /trans) on smoothing scale can be used to test theoretical predictions, in 
addition to the dependence on galaxy proximity itself. Note that the mean of the uncondi- 
tional flux PDF does not depend on Rg, since smoothing preserves flux overall, but the mean 
decrement for conditional PDFs decreases with increasing Rg, since pixels close to galaxies 
tend to have high decrements that are reduced by smoothing. 

Another indicator of the physical scale of correlations is the dependence of the condi- 
tional PDF on the velocity separation between the pixel redshift and the galaxy redshift. 
In Figure 5, long-dashed lines and solid symbols repeat results for the case investigated in 
Figures 2-4, velocity separation AV — 0. Upper and lower panels show O^ax — 2' and 0.5', 
and left and right columns show smoothing lengths Rg — 10 and 50 km s~^. Dashed, dotted, 
and dot-dashed lines show the conditional PDFs computed from pixels that are offset by 
|AV"| = 100. 200, and 400 km s^^ from the galaxy redshift, where we have averaged the 
contributions at ±AV. For |Al^| = 100 or 200 km s^^, the signature of galaxy proximity on 
the flux PDF persists, but it is signiflcantly weakened. By \ AV\ = 400 km s^^, it has mostly 
vanished. The dependence on velocity separation provides another potential test of the sim- 
ulation predictions. Perhaps more significantly. Figure 5 shows that redshift measurement 
errors of ~ 100 — 400 km s~^ would significantly affect the shape of the conditional fiux PDF. 
We will return to this point in §3.6 below. 

The unconditional flux PDF is weighted by volume, since every volume element has 
an equal chance of becoming a pixel in a Lya forest spectrum. Therefore, even if galaxies 
traced the underlying mass distribution, we would expect the conditional fiux PDF to differ 
from the unconditional PDF because the mass itself is preferentially located in overdense 
regions. Figure 6 demonstrates, however, that the strong dependence of the fiux PDF on 
galaxy proximity is primarily a result of galaxy bias, not the difference between volume and 
mass weighting. Light curves, repeated from the corresponding panels of Figure 4, show the 
unconditional PDFs and the conditional PDFs for galaxy separations A^^ < 2' and A^^ < 0.5'. 
Heavy curves show conditional PDFs in which we use randomly selected dark matter particles 
in place of galaxies. While these curves are shifted relative to the unconditional PDFs shown 
by the sohd fines, they are not shifted nearly as much as the galaxy conditional PDFs. High- 
redshift galaxies form in special locations of the large scale matter distribution, and the 
signature of these special environments is imprinted on the nearby Lyct forest. 
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3.3. Dependence on Galetxy Properties 

In the local universe, galaxies with different stellar populations, star formation rates, 
morphologies, and masses tend to populate different environments, and the same may be true 
of high redshift galaxies. Theoretical models predict some dependence, albeit rather weak, 
of the LBG autocorrelation function on galaxy mass or SFR (e.g.. Mo & Fukugita 1996; 
Katz, Hernquist, & Weinberg 1999), and observations provide at least tentative evidence for 
stronger clustering of brighter LBGs (e.g., Adelberger et al. 1998; Giavalisco & Dickinson 
2001). The dependence of the conditional flux PDF on the properties of the conditioning 
galaxies is a potential diagnostic for the dependence of LBG properties on large scale envi- 
ronment. Here we will focus on instantaneous SFR for defining different galaxy subsamples, 
since it is the property most closely tied to rest-frame UV luminosity. In the simulated 
galaxy population, these SFRs are correlated with galaxy baryonic mass (Weinberg et al. 
2002b), so this procedure is approximately, but only approximately, equivalent to selecting 
on baryonic mass. 

In Figure 7, heavy solid lines show the conditional flux PDFs for the full galaxy popu- 
lation of the L22nl28 simulation, as in Figure 4, for < 2' (top) and A^^ < 0.5' (bottom), 
with spectral smoothing Rs = 10km (left) or 50km s"-*^ (right). Light solid hnes show 
the corresponding results for the full population of resolved galaxies (M^ > 64mspH = 
5.4 X lO^'^ Mq) in L50nl44. The space density of this more massive galaxy population is 
S = 3 arcmin"^ Az~^, with 450 galaxies in the simulation volume. Dotted lines show results 
for the 150 galaxies with the highest SFRs, and dashed lines show results for the top 50 
galaxies. The conditional flux PDF is remarkably insensitive to galaxy SFR over a range of 
nearly 150 in space density, though for A^^ < 0.5' the more massive, high SFR galaxies do 
show slightly higher mean decrement and saturated fraction and slightly lower transparent 
fraction, as expected if they reside in denser environments. 

One of the simplest and most widely used models for the spatial clustering of LBGs 
associates each LBG with a single dark matter halo (e.g.. Mo & Fukugita 1996; Adelberger 
et al. 1998; Moscardini et al. 1998; Bagla 1998). The large scale bias factor of these halos 
can be computed using the approximations of Mo & White (1996) and Sheth, Mo, & Tormen 
(2001). Katz, Hernquist, & Weinberg (1999) show that the correlation function oi z — 3 
galaxies in SPH simulations hke these is similar to that of dark matter halos of the same 
space density, even though the galaxy-halo association for a given space density is not truly 
1-to-l. Figure 8 presents an analogous comparison for the conditional flux PDF. Light lines 
and symbols show results for the 450 resolved galaxies in the L50nl44 simulation, while 
heavy lines and symbols show the results when these galaxies are replaced by the 450 most 
massive dark matter halos in the simulation (identified by a friends-of-friends algorithm with 
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linking parameter of 0.2). The conditional PDFs for galaxies and halos are very similar, with 
galaxies showing a slightly stronger preference for regions of high flux decrement, suggesting 
that they are more likely to occur in halos with higher density environments (where the 
halos themselves tend to be more massive). This similarity holds for other choices of angular 
separation, spectral smoothing scale, and population space density, not shown in Figure 8. 

A more general model for galaxy bias is the "halo occupation distribution" (HOD) 
formalism, which characterizes the mean number of galaxies of a given type in halos of 
virial mass M by an arbitrary function Ai'avg(M). This approach has been widely applied 
to studies of low redshift clustering (e.g., Jing, Mo, & Borncr 1998; Scljak 2000; Ma & Fry 
2000; Peacock & Smith 2000; Scoccimarro et al. 2001; Berlind & Weinberg 2002; Cooray 
2002; Cooray & Sheth 2002; Marinoni & Hudson 2002; Scranton 2002; Zehavi et al. 2003), 
and Bullock et al. (2001) and Moustakas & Somerville (2002) have analyzed LBG clustering 
in this context. If we neglect the finite size of the halos themselves, then the influence of 
different Nsyg{M) choices on the conditional flux PDF can be calculated by simply weighting 
the contribution of each halo by Nsyg{M). Figure 9 shows the conditional flux PDF based on 
randomly selected dark matter particles (solid line, as in Figure 6) and on the top 450 halos 
of the L50nl44 simulation, with contributions weig hted by oc M° (one galaxy per halo), 
W (X M (galaxy number proportional to halo mass), and additional cases W oc M"-^ and 
W oc M^-^. While the bias of halos with respect to dark matter remains clearly evident, the 
conditional flux PDF is almost completely independent of the relative weighting of the halos, 
and, by implication, of the galaxy HOD over the rather wide parameter range considered 
here. Similar results hold for other angular separations and spectral smoothings. Presumably 
the more massive halos do tend to reside in denser environments, but saturation removes the 
sensitivity of the flux decrement to the neutral hydrogen density once it becomes sufficiently 
high. Thus, the sign of the effect, more absorption when more massive halos are weighted 
more strongly, is in the expected direction, but the magnitude is tiny. 

At one level, the results in Figures 7-9 are disappointing, since they suggest that the 
conditional ffux PDF will not help to distinguish among competing scenarios for the origin 
of LBG properties. However, the robustness of the predictions to the details of the galaxy 
formation assumptions makes the conditional ffux PDF a good diagnostic for the inffuence 
of galaxies on the IGM. Since a wide range of reasonable galaxy bias models make similar 
predictions, a substantial departure from these predictions would most likely arise from 
galaxies directly affecting the Lyo; forest through photoionization, winds, or other processes. 
Furthermore, differences in the conditional ffux PDF for different galaxy classes — based 
on luminosity, stellar populations, or emission line profiles, for example — would probably 
reflect their differing effects on the IGM, not their different clustering properties. 
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3.4. The Conditional Mean Flux Decrement 

The dependence of the mean flux decrement on the galaxy proximity condition pro- 
vides a convenient summary of our resuhs. The mean decrement does not capture the full 
information in the PDF; we have tried, for example, scaling the optical depths of the un- 
conditional flux PDF by a constant factor to see whether we can recover the conditional 
flux PDF simply by matching its mean, and we cannot. However, measurement of the full 
conditional flux PDF requires many galaxy-quasar pairs, so measurements of the conditional 
mean decrement are likely to come much sooner (e.g., ASSP). 

Figure 10 plots the mean flux decrement as a function of maximum angular separa- 
tion ^max for galaxy populations of different space density, summarizing and reinforcing the 
results in Figures 4 and 7. The spectral smoothing length is Rg — 10 km s~^. Hexagons 
and pentagons show results for the full populations of resolved galaxies in the L22nl28 and 
L50nl44 simulations, with space densities S = 48 arcmin^^ A^;^^ and 3 arcmin^^ A^;"^, re- 
spectively. Squares and triangles show results for the 150 and 50 "brightest'' (i.e., highest 
SFR) galaxies in L50nl44. For A^^ < 6„iax = 4' (4.8 Mpc comoving) the mean decrement 
is already well above the global mean of 0.36, and it rises steadily with decreasing ^^max as the 
Lya spectra probe the denser IGM in the immediate vicinity of the galaxies, with (D) > 0.9 
for ^max = 0.25'. The mean decrement varies only weakly with SFR, with the rare, high 
SFR galaxies having slightly higher mean decrement. The x's show, for the full popula- 
tion of L50nl44, the conditional mean decrement computed in bins of angular separation 
0.86'inax < A^^ < 1.25^inax, instead of all separations A^ < 6'max- This "differential" form 
of the statistic shows a similar trend, but the increase of (D) at a given ^^ax is, of course, 
smaller, since closer separation pairs have been eliminated. 

Figure 11 plots the mean decrement against 9„iax for random dark matter particles and 
for halos weighted in various ways, as in Figures 6 and 9. The effect of bias stands out clearly 
as an offset between the matter and halo results. Results for halos are similar to those for 
galaxies in Figure 10, though the mean decrements near halos are typically slightly lower. 
The weighting of halos has only a small impact on the conditional mean decrement, with a 
modest increase in (D) for a stronger weighting of high mass halos. 

3.5. Local Photoionization 

We have so far assumed that the photoionizing background is uniform, and Croft et al. 
(2002b) show that this assumption should be adequate for most computations of Lya forest 
statistics. However, the conditional flux PDF is derived from the small fraction of pixels 
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that lie close to galaxies, and if the galaxies make a significant contribution to the ionizing 
background, then the flux decrement in these pixels may be depressed by the ionizing flux of 
their nearest galaxy neighbors. With the ionizing background predicted by Haardt & Madau 
(1996) based on the observed quasar population, these simulations (and similar ones by other 
groups) already match the observed mean decrement of the Lja forest given a baryon density 
Qb = 0.02/i~^. However, estimates of the background based on the proximity effect usually 
yield a higher intensity than predicted from the quasar population alone (Scott et al. 2000, 
and references therein), and observations of LBGs suggest that the escape fraction of ionizing 
photons is high enough to make them an important contributor to the UV background 
(Steidel, Pettini, & Adelberger 2001). Within the theoretical and observational uncertainties, 
there is probably room for galaxies to make a contribution equal to that of quasars without 
making the predicted mean flux decrement too low at 2; ~ 3 (see Schirber & Bullock 2003 
for a recent discussion of these issues) . 

The intensity of the radiation from an individual galaxy or quasar falls off as 1/r^, 
but it is further attenuated by redshifting and, more importantly for the case of ionizing 
radiation, by IGM absorption. Haardt & Madau (1996; see also Madau, Haardt, & Rees 
1999) estimate that a path length Az ~ 0.17 produces an optical depth of r(912A) ~ 1 
at z = 3, corresponding to 100 /i^^ Mpc (comoving) for our cosmology. Fardal & ShuU 
(1993) conclude that the effective attenuation length for "average" ionizing photons is ~ 2.4 
times that at 912A, so a reasonable estimate (with probably a factor of two uncertainty) is 
'"att ~ 240 Mpc (comoving). Here we will make the simple approximation that intensity 
falls as 1/r^ until Tmax = '"att ^-nd is sharply truncated beyond Tmax- If a population of galaxies 
with comoving space density n and mean ionizing luminosity (L) contributes a fraction / of 
the ionizing background intensity /, we then have 

^ J ^Trr^^rn^^ = n{L)r^^ , (4) 

implying 

J ^ ?^(-^)^max 

The intensity from a galaxy with luminosity Lj is equal to the mean background intensity 
at an "influence radius" given by 



ATrrf 



_{L) Anrl^^n 



(6) 



Note that rj decreases as either rmax or n increases, since either change makes the contribution 
of a nearby galaxy less important relative to that of the numerous, distant galaxies. 
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For / = 0.5, Li = (L), r^ax = 240/1"^ Mpc, and n = 1.2 x lO-^^/i^Mpc"^ (E = 
1 arcmin"^ A^~^), equation (6) yields rj = 0.37 Mpc (all length scales comoving). For 
our cosmological model, the corresponding angular and velocity scales are = 0.3', AV = 
47 km s~^, respectively. This simple estimate suggests that galaxy photoionization could 
have a significant impact on the conditional fiux PDF at separations of A^^ ~ 0.5', though 
its impact at A9 ~ 2' is likely to be small. The optical depth of gas at temperature T, with 
electron density rig, and at a distance r from a galaxy is reduced relative to the optical depth 
Tu of the uniform background case by a factor 

^1 -1 



where 



1 + Fr 



(7) 



1 ^ r„ + re(r)ne ^ ^ ^ 0.29(p/p)e-i-5^8myV2^^ ^ yV2)-i 

is the ratio of the total (photo + collisional) ionization rate to the uniform photoionization 
rate F^. The second part of equation (8) incorporates the Haardt & Madau (1996) value of 
Fu = 8.3 X 10~^^ s~^ and the Cen (1992) expression for collisional ionization at temperature 
T — lO^Ts K (which is also used in TreeSPH and TIPSY), assuming fully ionized gas to 
relate Ue to p/p. 

Figure 12 illustrates the impact of galaxy photoionization on the conditional mean 
decrement (left panels) and saturated fraction (right panels) for the S = 1 arcmin"^ Az~^ 
sample from the L50nl44 simulation. We compute the influence radii via equation (6) 
assuming that each galaxy's ionizing flux is proportional to its SFR and that galaxies of 
this space density collectively produce a fraction / = 0.5 of the ionizing background, with 
fainter galaxies having no signiflcant ionizing flux. Asterisks show the result of a simplifled 
calculation in which we multiply the optical depths of the extracted spectra by the factor in 
equation (7) assuming Fp = 1 and a pixel-galaxy distance r corresponding to the transverse 
separation between the galaxy and the line of sight. As expected from the order-of-magnitude 
estimate above, the suppression relative to the uniform UV background case is small at 
A^ > 1', but the mean decrement and saturated pixel fraction are noticeably reduced for 
A0 < 0.5'. Triangles show the results of a complete calculation, in which we put individual 
galaxy UV sources into the simulation before extracting the Lya forest spectra with TIPSY, 
applying equation (7) to each SPH particle with r equal to the galaxy-particle separation 
and Fr computed from the particle's temperature and density. Here the reduction of {D) 
and /sat is much smaller, and the predicted trends no longer turn over at the smallest angular 
separations. 



Why does the approximate calculation drastically overestimate the impact of galaxy 
photoionization? One possibility is that collisional ionization in the hotter, denser gas within 
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r ~ 0.5 Mpc of galaxies dilutes the effect of photoionization by making Fr <^ 1. However, 
if we repeat the TIPSY calculation with Fr = 1 for all particles, we get a nearly identical 
result for the conditional mean decrement. CoUisional ionization is important along some 
lines of sight (compare equation [8] to Figure 1), but these arc cases where dense gas produces 
heavily saturated absorption, which remains saturated even if the optical depth is reduced by 
a factor of several. Thus, overestimating photoionization by setting Fp = 1 still leaves a flux 
decrement D 1. The second possibility is that the gas producing absorption at the galaxy 
redshift is further away than the transverse separation r^, and is shifted to the galaxy redshift 
by its peculiar infall velocity. Figure 13 shows the mean line-of-sight distance, weighted by 
optical depth, of gas that produces absorption within 20 km s~^ of the galaxy redshift. 
We consider only galaxy-spectrum pairs with < 0.5', implying a transverse separation 
Tt < 0.6 h^^ Mpc (comoving); results are similar for A^^ < 0.25'. Figure 13 shows that 
absorption for these close pairs arises mainly in gas whose line-of-sight distance substantially 
exceeds the influence radius and is therefore little affected by galaxy photoionization. 
When the optical depth is high (r > 10, left-hand panel), the mean line-of-sight distance is 
usually smaller, but in these cases even a factor of several reduction in r does not move the 
flux decrement significantly below D — 1. Thus, even if galaxies contribute a large fraction 
of the ionizing background, local photoionization has little effect on the conditional mean 
decrement or saturated fraction because the real-space distance to absorbing gas is larger 
than the redshift-space distance, except for lines of sight that are heavily saturated. 

Steidel et al. (2002) find that ~ 3% of the galaxies in their LEG sample exhibit detectable 
AGN activity. Given the apparent ubiquity of supermassive black holes in local galaxies, 
it seems plausible that most LBGs go through AGN phases, with /q ~ 0.03 being the 
"duty cycle", i.e. the fraction of time that these black holes are active at 2; ~ 3. Could 
photoionization by these low luminosity AGN have a larger impact on the local IGM than 
the galaxies themselves? If the interval between periods of AGN activity is longer than the 
time 

eq 

teq = = [r„ + r,(T)ne]-^ « 4 X lO^Fp yrs (9) 

that it takes gas to return to its equilibrium neutral hydrogen fraction after being fully 
ionized,^ then we are essentially back to the uniform ionization case: the few LBGs that 
currently host AGN have large "proximity zones," but the rest have their normal complement 
of associated Lya optical depth. However, if the process of feeding gas to the central black 
hole is sufficiently stochastic, then AGN might "fficker" on and off on timescales shorter than 



•^We assume that the equihbrium state is aheady highly ionized, so that the addition of hydrogen 
ions does not significantly increase nnii or rig. The second equality follows from the equilibrium condition 
a{T)nenmi = r„n^^i + rc(T)nen^V 
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teq, with individual activity cycles lasting tact ^ fgteq ^ 10^ years. In this case, gas around 
a large fraction of LBGs could be out of ionization equilibrium, with the neutral hydrogen 
fraction depressed by the memory of the most recent AGN outburst. 

Suppose we assume that the AGN associated with the galaxies under study collectively 
produce 50% of the ionizing background. Since the time-averaged flux of this population 
equals, by assumption, the flux that we previously ascribed to the galaxies themselves, the 
photoionization rate at distance r during an active phase must exceed the previous value 
by a factor /q^, i.e., Tq = Vu{ri/rY fq^ . During an active phase, the neutral hydrogen 



density at distance r drops exponentially in time, nni ~ n^j exp(— Fq^), until it reaches a 
new equilibrium value with Lya optical depth 



However, at most radii the duration of the active phase will not be long enough to achieve 
this equilibrium. If we assume that the activity during a recombination interval ^eq occurs in 
a single "outburst" of duration tact = fgteq, then the optical depth at the end of this phase 
is 



except that it never falls below the equihbrium value of equation (10). Note that the duty 
cycle fq cancels out of equation (11) because we have fixed the time-averaged flux of the 
AGN population relative to F^. If the AGN are more luminous while they are on, then 
they must be active for less time, producing the same number of ionizing photons. We 
see from equation (11) that the scale over which recurrent AGN activity can affect the 
conditional mean flux decrement is the same influence radius that we found previously, but 
that departures from photoionization equilibrium allow the impact within this radius to be 
much stronger. 

Since we have already found that much of the absorption for small comes from gas 
beyond the influence radius, we can guess from this analysis that local AGN photoionization 
will have little impact on the conditional mean decrement. Filled circles in Figure 12 conflrm 
this expectation, showing a calculation based on equation (11), with sources embedded in 
the TIPSY spectral extraction and the same influence radii used previously. The minimum 
optical depth is given by equation (10) with /^^ = 30. While the absorption is slightly 
suppressed relative to the galaxy photoionization case, the mean decrement and saturated 
fraction continue to increase as ^max decreases. These results can be considered a conservative 
upper limit on the effects of recurrent AGN activity, since we have ignored the partial return 
to equilibrium with the uniform background that will occur following any given active cycle. 





1 



(10) 




(11) 
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If we attributed 100% of the ionizing background to the AGN associated with the observed 
LBGs, then the influence radii would be larger by 2^/^, but the impact on the conditional 
mean decrement would be only slightly stronger. 

We conclude that local photoionization by galaxies or by the AGN that they host will 
not reverse the trend of increasing Lya forest absorption with decreasing angular separation. 
Figure 13 suggests that any feedback mechanism must have a strong influence out to comov- 
ing distances ~ 1 Mpc or more if it is to effect such a reversal. We have examined simple 
models in which we completely eliminate neutral hydrogen out to a fixed radius around tar- 
get galaxies or, alternatively, reduce the neutral fraction within this radius by a factor of 
three. If we eliminate all neutral hydrogen to r = 1 h^^ Mpc, then the conditional mean 
decrement drops to {D) 0.6 for ^^ax = 0.25' and 0.5', and the saturated pixel fraction falls 
to /sat ~ 0.4. However, a factor of three reduction out to 1 Mpc, or complete ehmina- 
tion out to 0.5 Mpc, has only small impact, comparable to that of the AGN ionization 
model shown in Figure 12. Elimination of neutral hydrogen out to 0.75 /i"-*^ Mpc has an 
intermediate effect, reducing {D) to ~ 0.75 and /sat to ~ 0.5. 



3.6. Redshift Errors 

We showed in Figure 5 that the signature of the overdense environments of galaxies in 
the conditional flux PDF is substantially weakened if one looks at pixels ~ 200 km s~^ away 
from the galaxy redshift instead of at the galaxy redshift itself. (The angular separation cor- 
responding to Hr — 200 km s~^ is 1.3'.) Since the redshifts of LBGs are usually estimated 
from nebular emission or absorption lines, which could be offset relative to the mean sys- 
temic velocities, this result suggests that redshift measurement errors could have a significant 
impact on practical determinations of the conditional flux PDF. To address this point quan- 
titatively, we show in Figure 14 the conditional mean decrement and saturated fraction when 
pixel redshifts are drawn from a Gaussian of mean zero and dispersion (Taf = 150 km s~^ 
(squares) or 300 km s~^ (triangles). We use the E = 1 arcmin"^ Az"^ sample of the L50nl44 
simulation, with no galaxy photoionization. Redshift errors of ctav = 150 km s~^, similar to 
those estimated by ASSP, depress (D) and /sat by ~ 0.05 at ^max = 2' and by ~ 0.1 — 0.2 
at ^max < Errors of (Tav = 300 km s^^ depress (D) and /sat more severely, and they 
largely (though not entirely) remove the trend of increasing absorption with decreasing ^max- 
Redshift errors also reduce the already small differences between models with and without 
galaxy photoionization, since the absorption is now usually measured at a pixel further away 
from the galaxy. 



4. Lcirge Scale Correlations 



Inspired by the observational work of ASSP and ASPS, we also examine a statistical 
measure that characterizes LBG-Lya forest correlations on larger scales than those examined 
in §3. We divide each of the 1200 random spectra through the L50nl44 simulation into L/ S 
segments of comoving length where L = 50 Mpc is the box size. For each segment i, 
we compute the mean flux decrement Ds,i along the segment, and we count the number of 
galaxies Ni in a cube of side length S centered on the spectral segment, making use of the 
simulation's periodic boundaries as necessary. We include galaxy peculiar velocities when 
computing positions along the line of sight, again utilizing the periodic boundaries when 
needed. We then consider all segments in a bin oi Dg values and compute the mean value 
of the galaxy density contrast 5gai,j = Ni/N — 1, where N = S'^n is the average number of 
galaxies in a cubic cell of size S. We use the full resolved galaxy population of the simulation, 
for which N — 3.6{S/10h~^ Mpc)^. Poisson noise fluctuations in 5gai are large, and if one 
plots the mean value of Ds in bins of 5gai, then a "Malmquist" type bias effectively erases any 
trend (there are more cells of intrinsically low density contrast that scatter to high 5gai than 
vice versa). However, Ds is not a noisy quantity, and if one has many cells, then Poisson 
fluctuations in Sgai average to zero without bias. A plot of Sgax in bins of Ds therefore reveals 
the preferential location of galaxies in overdense environments. 

The solid points in the upper panels of Figure 15 show Sg^i vs. Ds for 5" = 10 Mpc 
(left) and 5 Mpc (right). Error bars represent the la error on the mean, i.e., the disper- 
sion of the Nseg values of 5gai in the Ds bin divided by (iVseg — 1)^^^. Smooth curves show 
the cumulative distribution of Ds values. Only a small number of segments have Ds > 0.6 
for S* = 10 h^^ Mpc, so the error bars on 6gsi become large for high Ds values. Nonetheless, 
there is a clear, strong trend of increasing 5gai with increasing Ds up to Ds ~ 0.65, where 
the mean galaxy overdensity is 1 + 5gai ~ 2.5. For S — 5h~^ Mpc, there are more high 
decrement segments and the trend of 5gai with Ds continues to Ds ~ 1. 

The open points in Figure 15 show the results using dark matter particles instead of 
galaxies. While there is a steady trend of with Ds, the bias of the galaxy population 
is strikingly evident in the much steeper slope of the Sg^i — Ds trend. The suppression of 
high redshift galaxy formation in underdense regions is especially clear, with mean density 
contrasts 5gai approaching —1 for Ds < 0.2. In the S — 5h~^ Mpc plot, the physical 
requirement that 5 > —1 forces the galaxy correlation to be concave upward at low Ds- 
Peculiar velocities make the Sdm — Ds correlation significantly steeper in redshift space than 
in real space, as coherent flows amplify density contrasts (Sargent & Turner 1977; Kaiser 
1987). However, the 5gai — Ds correlation is minimally affected by peculiar velocities owing 
to the high bias (and correspondingly low /b) of the galaxy population. 
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The rms fluctuations of Ds and density contrast are (ct/j, (7dm, Cgai) = (0.13,0.35, 1.01) 

for S = 10 Mpc and (0.17,0.56,1.93) for S — 5h~^ Mpc. Here we have subtracted a 
shot noise contribution to the galaxy dispersion assuming Poisson statistics, i.e., we define 
"^gai = [{{^ ~ ^)^) — /N, where (...) represents an average over aU ceUs and the 

division by N converts from a dispersion of iV to a dispersion of 5gai. We also subtract 
shot noise when computing adm, but since the number of dark matter particles is large, the 
correction is negligible. 

The ratio of correlation slopes between Sgsi — Ds and Sdm — Ds is roughly the ratio of (jgai 
to (Jdia- as we show in the bottom panels of Figure 15, which plot 5gai/o"gai or 5dm/o'dm against 
{Ds — D)/(ju-, where D = 0.36 is the unconditional mean decrement. In these normalized 
variables, the galaxy and dark matter correlations are similar, though the constraint 5 > —1 
forces the galaxy correlation to become shallower at low Ds, while the dark matter trend 
remains linear down to essentially zero flux decrement. Thus, linear bias appears to be a 
good but not perfect approximation for this statistic. Our results for dark matter agree 
well with those of McDonald et al. (2002), who examine the unnormalized and normalized 
correlations of Sdm with Ds using the hydro-pm approximation of Gnedin & Hui (1998). 
For 10 h^^ Mpc cubes, they find a correlation with a slope of 0.6 in normalized variables, 
bending towards shallower slopes for {Dio — D)/aD ^ 1 (^f ^ 1 in their notation). The 
box diagonals of the lower panels of Figure 15 have a slope of 0.6, so one can see by visual 
inspection that we obtain a similar slope and turnover, and that the slope for bh"^ Mpc 
is nearly the same as for 10 h"^ Mpc. McDonald et al. (2002) show that variations in the 
Lya forest assumptions — e.g., the temperature-density relation, the mean decrement, or 
the flux power spectrum — have only a small effect on the (5dm — Ds correlation when it is 
plotted in normalized variables, so we expect the same would hold true for the (5gai — Ds 
correlations shown here. 

As a quantitative measure of correlation strength, we compute the cross-correlation 

coefficients rgai,D = {^^^x^d) /{(yg^ifyo) and rdm,D = (5dm5z))/(o-dmO-D), where 6d = Ds - D, 
and the averages are over all cells/segments. At 10 h^^ Mpc, the correlation coefficients are 
0.49 and 0.66 for galaxies and dark matter, respectively. The coefficients for dark matter 
are substantially higher in redshift space than they would be in real space, while for galaxies 
they are nearly identical in redshift space and real space. The cross-correlation between 
galaxies and dark matter is high, rgai,dm ~ 0.95, and would again be lower in real space. 
However, rgai,D is lower than rdm,D for both cube sizes, perhaps because the curvature of 
the (5gai — Sdm relation at low 5d suppresses the contribution of these cells to the correlation 
coefficient. As emphasized by ASPS, the correlation rdm,D is necessarily below unity because 
the flux decrement Ds gives a (non-linearly) weighted average of the density along the line 
of sight through the cube center, and this quantity is imperfectly correlated with the mean 
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density of the cube as a whole. 

Figure 16 resembles Figure 15, but here the S — Dg correlation for galaxies is compared 
to that for the 450 most massive friends-of-friends halos, with the same weighting schemes 
used in Figures 8 and 9. The correlation is insensitive to halo weighting except at high Ds, 
since massive halos are rare in lower density regions and assigning them higher weights makes 
little difference. At high Ds^ the contrast is usually stronger when massive halos receive more 
weight, as one would expect, but our results in this regime are fairly noisy. The rms density 
contrast a is sensitive to the relative halo weighting, so in normalized variables (bottom 
panels) the correlation is actually shallower when massive halos are more strongly weighted, 
and correlation coefficients are correspondingly lower. The galaxy correlations most closely 
resemble those for equally weighted halos or halos with weight oc M°-^, and both the rms 
fluctuations of galaxies and their correlations with Lya flux decrement are lower than they 
would be if the number of galaxies were proportional to halo mass. 



5. Discussion 

The conditional flux PDF considered in §3 and the (5gai — Ds correlation considered in 
§4 offer empirical tools with which to test the prediction that high redshift galaxy formation 
is strongly biased, and to search for the influence of star-forming galaxies on the surround- 
ing IGM. The predicted flux PDF shifts systematically towards higher absorption in pixels 
close to galaxies — the smaller the angular separation, the higher the mean decrement and 
saturated fraction and the lower the transparent fraction. The shift of the flux PDF remains 
strong in spectra smoothed over 50, 100, or 200 km s^^, allowing further tests of the predicted 
scale of correlations and demonstrating that moderate resolution spectra of relatively faint 
AGN may profitably be used in such investigations. The influence of galaxy proximity is 
much stronger than that of random mass elements (dark matter particles) , a clear signature 
of the preferential formation of galaxies in overdense environments. 

The conditional flux PDF is only weakly sensitive to the baryonic mass or SFR of the 
sample galaxies over the range that we have investigated, spanning more than two orders 
of magnitude in space density. It is also insensitive to the occupation distribution of dark 
matter halos. While this insensitivity reduces the utihty of this statistic as a diagnostic of 
galaxy formation, it increases its power as a diagnostic of galaxy feedback, since the no- 
feedback prediction is robust. We have investigated the potential impact of one feedback 
process, galaxy photoionization, and flnd that it has only a small impact on the conditional 
mean flux decrement, even if the bright galaxies used in the conditional PDF measurement 
contribute a large fraction of the ionizing background. The local flux from these galaxies is 
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comparable to the uniform background at "influence radii" n ~ OAh"^ Mpc (comoving), 
corresponding to angular separations ~ 0.3', and in the absence of pecuhar velocities 
this flux would drive down the average absorption at smaller separations. However, the gas 
producing absorption at the galaxy redshift is typically > 1 Mpc away along the line 
of sight, where it is minimally affected. Recurrent AGN activity, with associated depar- 
tures from photoionization equilibrium, can have a stronger impact within r^, but it does 
not substantially increase the typical influence radius, so it still has httle effect on the ffux 
decrement close to galaxies. The predicted dependence of (D) on is sensitive to red- 
shift measurement errors, with rms errors aAv ^ 150 km s~^ signiflcantly depressing the 
absorption signature. 

There is signiflcant overlap between our study of the conditional flux PDF and the recent 
paper by Croft et al. (2002a), who independently investigate the conditional mean decre- 
ment statistic and other measures of LBG-Lya forest correlations, using similar numerical 
techniques. As discussed in §3, Croft et al. (2002a) and ASSP define the conditional mean 
decrement in a slightly different way from us, averaging over all galaxy- pixel pairs in bins of 
redshift-space separation A^ instead of considering only pixels at the galaxy redshift. Aver- 
aging in redshift-space separation bins instead of angular separation bins at AV = (like 
the "differential" points of Figures 10 and 14) has a small but noticeable impact, yielding 
shghtly less absorption at a given comoving distance. To facilitate comparison to the Croft 
et al. (2002a) and ASSP results, we have calculated their version of the mean decrement 
statistic for the 150 highest SFR galaxies in the L50nl44 simulation (E = 1 arcmin"^ A^;"^). 
Results are shown in Figure 17, assuming rms redshift determination errors of 0, 150, and 
300kms~^ (triangles, squares, and pentagons, respectively). 

Croft et al. (2002a) adopt slightly different cosmological parameters, with = 0.3, 
as — 0.9 instead of — 0.4, as — 0.8. Relative to our L50nl44 run, their impressive, 
2 X 300^ particle simulation has a volume 3.3 times smaller but mass resolution 30 times 
higher. Solid and dotted curves in the left hand panel of Figure 17 show their results for 
galaxies with Mb > lO^^M© and Mb > IO^^Mq (Croft et al., fig. 8), kindly provided by 
R. Croft. Given the differences in cosmological and numerical parameters, the two inde- 
pendent calculations agree remarkably well. The comoving space densities of the two Croft 
et al. samples, 0.028 and 0.0008 /i^Mpc"^, correspond to E = 24 arcmin"^ Az"^ and E = 
0.67 arcmin""^ A^;"^ for our cosmology, so they are comparable to our E = 48 arcmin"^ A^;"^ 
and E = 1 arcmin"^ Az~^ samples, respectively. As shown in Figure 10, we find no signifi- 
cant difference in the conditional mean decrement for these two samples, while Croft et al. 
find stronger absorption for the more massive galaxies. Since their Mb > IO^^Mq sample 
comprises only 30 galaxies, this difference in the results does not seem too serious at present. 
Croft et al. also consider higher density samples, with mass resolution thresholds below the 
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resolution limit of our L22nl28 simulation, and they find a continuing though modest trend 
of decreasing conditional mean decrement with decreasing galaxy mass threshold. In agree- 
ment with Croft et al., we find that redshift determination errors of cr^y ~ 150 — 300 km s^^ 
have a substantial impact on the predicted trend of (D) with galaxy separation, and that 
galaxy photoionization does not have a large impact. 

ASSP have investigated LBG-Lya forest correlations observationally, carrying out an 
LBG spectroscopic survey in fields around six quasar lines of sight. Their estimates of the 
conditional mean decrement, kindly provided by K. Adelberger, arc shown by the filled 
points in Figure 17. These arc somewhat different from the points shown by Croft et al. 
(2002a) because of changes to ASSP's analysis procedures, in particular a scaling of optical 
depths with redshift that should make the results more directly comparable to predictions hke 
the ones presented here. The predicted and observed trends agree reasonably well down to 
comoving separations ~ 3 Mpc, with a smooth increase of ~ 0.1 in (D). However, ASSP's 
trend flattens towards smaller separations, while our predicted trend rises and steepens. 
Most strikingly, ASSP find a decrease in {D) at separations r < 1 Mpc, where the 
predicted absorption is strongest. If the rms redshift errors are (Jav ~ 300 km s~^ rather 
than the ~ 150 km s^^ that ASSP estimate, then the discrepancy between the predictions 
and observations is reduced, but the two data points at r < 1 Mpc remain in serious 
conflict. 

ASSP suggest that the flattening and eventual decline of the mean decrement in close 
proximity to LBGs is a signature of galaxy feedback on the surrounding IGM, perhaps caused 
by the galactic scale winds suggested by the nebular line proflles of LBGs (Pettini et al. 2001, 
2002). Croft et al. (2002a) consider a variety of galaxy feedback models and show that they 
can roughly reproduce the ASSP mean decrement results if ~ 10% of the available supernova 
energy is converted to kinetic energy of a coherent, galactic scale outflow that sweeps up all 
of the IGM in its path. While this wind model is ad hoc and idealized, it does show that it 
is energetically possible for supernova feedback to influence the conditional mean decrement 
out distances of a few hr^ Mpc (comoving). The range of the effect is larger than that of 
individual wind bubbles because the galaxies themselves are correlated, and a line of sight 
~ 2 Mpc from one galaxy may pass through the wind-blown bubble of another. We 
have shown (in agreement with Croft et al. 2002a and ASSP) that galaxy photoionization 
cannot explain the observed downturn in absorption, and we have shown that this remains 
the case even if one considers departures from ionization equilibrium that could be caused 
by recurrent AGN activity. 

The results of §3.5 (Figure 13 in particular) have an important implication for wind 
models: peculiar velocities allow gas at comoving distances 1 — 2 Mpc to produce strong 
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absorption at redshift-space separations — 0.5h~^ Mpc, so the required wind radius is 
larger than one would first guess from the scale of the observed downturn. Figure 18 demon- 
strates this point explicitly, using the E = 3 arcmin"^ Az~^ sample of the L22nl28 sample 
(the top 40 galaxies in the box). Squares, pentagons, and hexagons show the conditional 
mean decrement computed after eliminating all neutral hydrogen within a comoving radius 
Rw = 0.75, 1.0, or 1.5 Mpc around each of the target galaxies (see KoUmeier et al. 2002 
for further discussion). Only the Ryj — 1.5 Mpc case produces a downturn comparable 
to the observed one, and even it does not reproduce ASSP's innermost data point (as we 
discuss further below). Stars show a case in which we put winds around all 641 resolved 
galaxies in the box and scale the wind volume in proportion to each galaxy's baryon mass, 
Ryj oc M^^^ . This model has R^ ~ lh~^ Mpc for the 40th-ranked galaxy in the box and 
a maximum R^j ^ 2 h^^ Mpc. The impact is similar to, though slightly weaker than, that 
of the constant-radius R^ = 1.5 h^^ Mpc model. All of these models are, of course, highly 
simplified, but they show that galactic winds must efficiently eliminate neutral hydrogen 
out to large distances to explain ASSP's results. The energetic requirements for such winds 
are not impossible to meet (see Croft et al. 2002a), but they do require sustained outflow 
speeds of several hundred km s~^ for i > 1 Gyr and effective entrainment of surrounding 
gas. Winds with large filling factor can have a noticeable impact on the fiux power spectrum 
of the Lya forest, though for Rw < 1 hr^ Mpc the effect is small (Weinberg et al. 2003). 

The innermost ASSP data point is based on only three galaxy-QSO pairs, and the error 
estimation procedure is somewhat ad hoc (see ASSP), so it is natural to ask whether the 
discrepancy with the data is statistically significant. To address this point, we randomly 
selected 3-tuples of simulated galaxy-sightline pairs with < 0.4' and computed the mean 
decrement in the redshift separation bin A,. = — 0.5 /i^^ Mpc, repeating the experiment 
500 times. Figure 19 shows the cumulative distribution of these 3-tuple mean decrements, 
for our standard model with no redshift measurement errors (solid line), for redshift errors 
(Tav = 300 km s^^ (dashed line), and for the i?^ = 1.5 h^^ Mpc "wind" model from Figure 18. 
Vertical lines mark the unconditional mean flux decrement, {D) = 0.36, and the ASSP 
measurement for this separation, {D) — 0.11. With itav = 300km s~^, the 3-tuple mean 
decrement is below 0.36 only ~ 5% of the time, and none of our 500 trials have a mean 
decrement as low as the ASSP value. Even for the i?^ = 1.5 Mpc wind model, only 
~ 5% of the trials have a 3-tuple mean decrement as low as ASSP's. We did not incorporate 
redshift errors in the wind model, but (in contrast to the no-wind case) they tend to raise the 
predicted mean decrement by moving the measurement point further from the influence of 
the galaxy. We have only 150 simulated galaxies contributing to the solid and dotted curves 
and only 40 contributing to the dashed curve, so our 3-tuples are not independent and 
we may therefore underestimate the incidence of rare, low absorption cases. Nonetheless, 
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it appears that the ASSP results cannot easily be explained away by appealing to small 
number statistics. 

Given the stringent demands on wind models, it is worth considering alternative ex- 
planations. One possibility is that the IGM at these distances from galaxies is multi-phase, 
with processes that the simulations cannot resolve causing the geometrical cross section of 
the neutral phase to drop sharply. However, the typical physical conditions at these distances 
from galaxies are not extreme (see Figure 1), so it would be difficult to reconcile such an 
explanation with the overall success of simulations in reproducing (unconditional) statistical 
properties of the Lya forest. 

A final possibility worth considering is that extended Lya emission from the target 
galaxies "fills in" the Lya forest at the redshift of emission. Extended Lya "blobs" have 
been observed to be associated with LBG's (Steidel et al. 2000) having angular extents 
~ 15" and AB apparent magnitudes 21.02 and 21.14 in an 80A Lya band. Cooling radiation 
from gas settling into massive galaxies z — 3 naturally produces Lya flux of this order, 
and numerical simulations predict that such blobs should be present around typical LBGs 
(Fardal et al. 2001). Thus, a galaxy's Lya cooling radiation could plausibly replace the 
absorbed continuum flux of a 2 1st- magnitude background quasar. There are two substantial 
and partly cancelling corrections to this estimate. First, the Lya emission extends over 
~ 1000 km s^-*^ (16 A) rather than the 80A bandpass used by Steidel et al. (2000), so the fiux 
density at the Lya wavelength is a factor of five higher (—1.75 mag). Second, at a separation 
A^^ ~ 15 — 20" from a target galaxy, the ~ 0.8" sHts used by ASSP should intercept only 
~ 0.8/2A^? ~ 0.02 — 0.03 of the galaxy's extended Lya fiux (~ +4 mag), assuming constant 
surface brightness out to A^. The three quasars that contribute to ASSP's innermost data 
point have G-band AB magnitudes of 20.1, 21.6, and 23.4, so with the -1-2.25 mag net 
correction, it appears that Lya emission could replace the absorbed quasar flux only for the 
faintest of the three quasar targets, unless the galaxies in question are even brighter than 
the Steidel et al. (2000) blobs. Furthermore, a fourth pair involving the G = 17.8 quasar 
Q0302-0019 shows no sign of absorption near the galaxy redshift, and in this case the quasar 
is clearly too bright for galaxy emission to compete with it. (This pair and two others are 
dropped from ASSP's (D) calculation because of possible Ly/3 contamination.) At this point, 
therefore, the Lya emission explanation seems unlikely, but future observations that have 
more close pairs involving bright quasars will be able to test it conclusively. 

On the larger scales probed by the Sgsi — Ds correlation, our predictions are in qualitative 
agreement with the observational results reported by ASSP and ASPS. As the Fluctuating 
Gunn-Peterson Approximation suggests, higher mean flux decrements tend to arise in cells 
of higher mass density, and these in turn have higher average galaxy overdensity. The limited 
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size of our largest volume simulation, a single 50 Mpc cube, leaves statistical and system- 
atic uncertainties in our predictions. Nonetheless, our results for the S^m — Ds correlation 
at 5" = 10 Mpc are similar to those found by McDonald et al. (2002) using hydro-PM 
simulations (with 40 h^^ Mpc cubes but multiple realizations, and significantly higher mass 
resolution). The 5gai — Ds correlation is much steeper than the 6dm — Ds correlation because 
of the strong bias of the high redshift galaxy population, and the suppression of galaxy 
formation in underdense environments is especially clear. The ratio of correlation slopes 
is approximately equal to the ratio of rms fluctuation amplitudes, CTgai/cTdm, but a "linear 
bias" model does not fully capture the behavior of the 5gai — Ds correlation, in particular 
the curvature at low Ds imposed by the constraint that Sg^x > — 1. 

Our 50 Mpc simulation predicts a correlation coefficient rgai,D = 0.49 between galaxy 
density contrast and averaged flux decrement in 5h~^ Mpc or 10 Mpc cells. This is 
smaller than the value r = 0.68±0.06 that ASPS measure for S 10 h^^ Mpc, with a la error 
bar derived from the dispersion among five lines of sight. However, we cannot assign a reliable 
"theoretical error bar" to our predicted rgai,D without modeling ASPS's procedures in greater 
detail. Even for an ideal, volume-limited sample, there are subtle issues associated with shot 
noise subtraction and estimator biases (Hui & Sheth, in preparation), and corrections for 
redshift selection functions or sample incompleteness could also have a significant impact 
on estimates of r and its uncertainty. Accurate assessment of the discrepancy between 
predicted and observed correlation coefficients probably requires larger simulation volumes, 
which would allow creation of multiple independent "surveys" comparable to the observed 
one. Fortunately, we find that the S—Ds correlations for galaxies are similar to those for halos 
of the same space density, so it should be possible to carry out such assessments using large N- 
body simulations and the PM or hydro-PM approximation (Croft ct al. 1998; Gnedin & Hui 
1998) for the Lya forest. The absolute trend (i.e., not normalized by standard deviations) 
of 5gai with Ds is the best test for the strong bias of high redshift galaxies predicted by 
theoretical models. 

As the size of LEG spectroscopic samples grows, moderate resolution spectra of fainter 
quasars can greatly increase the number of lines of sight available for measuring LBG-Lya 
forest correlations. These larger samples will shrink the error bars of the existing mea- 
surements, and they will eventually allow measurement of the full conditional flux PDF for 
different subsets of the LEG population. The potential impact of redshift determination 
errors can be investigated empirically by comparing results for galaxies whose redshifts are 
estimated by different methods (e.g., emission vs. absorption lines). Models with galactic 
winds or other forms of feedback can be tested using the full PDF rather than the mean 
decrement alone. Variations in feedback from one galaxy to another should widen the range 
of flux decrement values in addition to reducing the mean, and winds that sweep up the 
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IGM should produce many lines of sight with essentially zero flux decrement. Perhaps the 
best diagnostic of galaxy feedback will be separate measurements for galaxy subsets with 
high and low star formation rates, or older and younger stellar populations, since one would 
expect the young/high SFR galaxies to have had more vigorous activity in the recent past. 
Systematic differences in the activity of different galaxy populations should produce sys- 
tematic differences in the corresponding conditional flux PDFs, and our results show that 
such differences would be unlikely to arise from differences in the spatial clustering of these 
populations. A full accounting of the relation between LBGs and the Lya forest will test 
our basic expectations for the way that dark matter shapes the large scale distribution of 
high redshift galaxies, and it will inform our picture of the ways that galaxies can affect the 
high redshift IGM. 
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Table 1. Simulation Parameters^ 



Name 


Lbox ( h-^ Mpc) 


N 


Cgrav ( h ^ kpc) 




64m,ph (Mq) 


Llln64 


11.11 


643 


3.5 


4.9 X 10^° 


6.7 X 10^ 


Lllnl28 


11.11 


1283 


1.75 


6.2 X 10^ 


8.3 X 10^ 


L22nl28 


22.22 


1283 


3.5 


4.9 X 10^° 


6.7 X 10^ 


L50nl44 


50.00 


1443 


7 


4.0 X 10^^ 


5.4 X 10^° 



^ Length units are comoving. In all cases, the adopted cosmology is = 0.4, Q\ — 0.6, 
h = 0.65, ftb = 0.02/i-^ n = 0.95, erg = 0.80, and the output redshift is z = 3. The 
angular size, velocity depth, and redshift depth of the 22.222 Mpc cube are 18.74', 
2844 km s^^. and 0.0379, and they scale in proportion to Lbox for the other simulations. Cgrav 
is the gravitational softening length (equivalent Plummer softening), setting the approximate 
spatial resolution. 
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Fig. 1. — Neutral hydrogen fraction, neutral hydrogen density, gas overdensity, and gas 
temperature (top to bottom) as a function of comoving transverse separation from galaxies, 
computed using all resolved galaxies in the L22nl28 simulation. Filled circles show median 
values for pixels at the galaxy redshift and this transverse separation, while the inner and 
outer edges of the vertical segments mark the 25-75% and 10-90% ranges, respectively. In 
the gas density panel, x 's show the mean overdensity, which is always higher than the median 
because of the skewed distribution of Pg/pg- For our cosmology, a transverse separation of 
1 Mpc corresponds to 0.843'. Scales along the top and bottom axes are marked in 
arc-minutes and comoving Mpc, respectively. 



-37- 



10 



0.1 




□-■ 
A- 



Random 

A0<2' 
Ae<0.5' 



0.01 



10 



0.2 0.4 0.6 0. 
D 



0.1 



0.01 




0.2 0.4 0.6 



Fig. 2. — Mass resolution effects on the conditional and unconditional flux PDFs, for spectra 
smoothed over 10km s~^ (left) or 50km s~^ (right). Heavy curves and symbols are computed 
for galaxies with baryonic mass M > 512mspH in the Lllnl28 simulation, while light curves 
and symbols are computed for galaxies with M > 64mspH in the Llln64 simulation. If 
there were no resolution effects at all on the galaxy populations or the spectra, then the 
results would agree perfectly. Solid curves show the unconditional PDF, computed using all 
pixels along random lines of sight through the simulation box. Dotted and dashed curves 
show P{D) for pixels at the same redshift as a galaxy with transverse separation < 2' or 
A6' < 0.5', respectively. Symbols along the top, right, and left axes show, respectively, the 
mean flux decrement, the fraction of "saturated" pixels (with D > 0.9), and the fraction of 
"transparent" pixels (with D < 0.55), computed from the corresponding PDF. 



-38- 




10 



0.01 




0.1 



0.01 



- □ ' 

□ / A/ 

- ^ C^'ll^ndom 

qt Ae<2' 
A - Ae<0.5' 

J ; I III' I ■ I 

0.2 0.4 0.6 0. 
D 



8 



Fig. 3. — Comparison of tlie unconditional and conditional flux PDFs from the L22nl28 
and L50nl44 simulations, in the same format as Fig. 2. Heavy curves and symbols show- 
results using the 37 galaxies with M > 512mspH in L22nl28, and hght curves and symbols 
show results using the 450 galaxies with M > 64mspH (the same physical mass threshold) 
in L50nl44. Differences can arise from systematic effects of resolution, box size, or random 
differences in the structures present in the two simulations. In the right hand panel, the 
heavy triangle representing the saturated fraction of L22nl28 is superposed on the light 
triangle, and the heavy triangle representing the transparent fraction is off the bottom of 
the plot. 
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Fig. 4. — The unconditional and conditional flux PDFs computed using all resolved galaxies 
(M > 64mspH) in the L22nl28 simulation. The four panels show results for spectral smooth- 
ing lengths of 10, 50, 100, and 200km s~^ (Gaussian la width), respectively. The format is 
similar to Figs. 2 and 3: solid curves show P{D) for all pixels in random lines of sight, while 
dot-dashed, dotted, and dashed curves show P{D) for pixels at the same rcdshift as galaxies 
with transverse separations A^^ < 2', 1', and 0.5', respectively. Symbols along the top, right, 
and left axes show the mean decrement, saturated fraction, and transparent fraction. 
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Fig. 5. — Dependence of the conditional flux PDF on velocity offset AV, computed using all 
resolved galaxies in the L22nf 28 simulation. Top and bottom panels show angular proximity 
conditions A^^ < 2' and A6' < 0.5', respectively, with spectral smoothing of 10 km s^^ (left) 
or 50km s^^ (right). In each panel, solid curves show the unconditional flux PDF, and 
long-dashed curves show the conditional PDF for pixels at the same redshift as a galaxy, 
as in Fig. 4. Dashed, dotted, and dot-dashed curves show conditional PDFs in pixels offset 
from the galaxy systemic velocity by |Ay| = 100, 200, or 400km s~^, respectively. Symbols 
show the corresponding mean decrements, saturated fractions, and transparent fractions. 
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Fig. 6. — The impact of galaxy bias on the conditional flux PDF, for spectral smoothing 
lengths of 10 km s~^ (left) and 50km s~^ (right). Light solid, dot-dashed, and dashed curves, 
repeated from the corresponding panels of Fig. 4, show the unconditional fiux PDF and the 
conditional flux PDFs for A^^ < 2' and A^^ < 0.5' about resolved galaxies in the L22nl28 
simulation. Heavy curves show the corresponding conditional flux PDFs using randomly 
selected dark matter particles in place of galaxies. Light and heavy symbols are based on the 
corresponding flux PDFs. While the conditional flux PDF about randomly selected mass 
elements differs from the unconditional flux PDF (which is effectively volume weighted), 
the deviations of the galaxy conditional PDFs are much stronger, a clear signature of the 
preferential formation of high redshift galaxies in overdense large scale environments. 
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Fig. 7. — Dependence of the conditional flux PDF on galaxy mass and SFR. In each panel, 
heavy solid lines show the conditional PDF based on all resolved galaxies in the L22nl28 
simulation, as in Fig. 4. The baryonic mass threshold is M > 6.7 x lO^M©, and the space 
density of the population is S = 48 arcmin^^ A^;"^. Light solid lines show results for the 
resolved galaxy population of L50nl44, with a mass threshold of 5.4 x IO^^Mq and a space 
density of E = 3 arcmin"^ Az"-*^. Dotted and dashed lines are for galaxies in L50nl44 with 
SFR > lO8M0yr~^ and SFR > 18OM0yr~^, respectively, with population space densities 
E = 1 arcmin^^ A^;"^ and E = 0.33 arcmin"^ A^;"^. 



-43- 




Fig. 8. — Comparison of halo and galaxy conditional flux PDFs. Light hnes and symbols, 
repeated from the corresponding panels of Fig. 7, show results for the resolved galaxies in 
L50nl44, with E = 3 arcmin"^ Az~^. Heavy lines and symbols show results when these 
galaxies arc replaced by friends-of-friends halos identified from the simulation's dark matter 
particle distribution. Halos are selected above a (dark matter) mass threshold of 5.7 x 
10^^ Mq, yielding 450 halos in the simulation volume and E = 3arcmin~^ Az~^. 
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Fig. 9. — Influence of halo weighting on the conditional flux PDF. Dot-dashed hnes, repeated 
from Fig. 8, show results in which each halo above the mass threshold contributes equally. 
Solid lines and filled symbols show results for randomly selected dark matter particles. Other 
hnes show results in which the contribution of each halo-pixel pair to the conditional PDF 
is weighted by M", with a = 0.5 (dotted), a = 1 (short-dashed), or a = 1.5 (long-dashcd). 
While the overall bias of halos makes the conditional PDF of halos substantially different 
from that of the mass, stronger weighting of high mass halos has only a small impact on this 
statistic. 
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Fig. 10. — The conditional mean flux decrement, for spectral smoothing of 10 km s~^. Open 
symbols show the mean decrement of pixels at the redshift of a galaxy with angular separation 
< ^max arc-minutes. Results are shown for the resolved galaxy population of L22nl28 
(E = 48 arcmin"^ Az"^), the resolved galaxy population of L50nl44 (E = 3 arcmin"^ 
and SFR-thresholded samples of the L50nl44 population with E = 1 arcmin"^ Az~^ and E = 
0.33 arcmin"^ Az~^. The x's show a "differential" form of this statistic, with {D) computed 
for separations in the range O.S^max < A^ < 1.25^max, for the S = 3 arcmin"^ A^;"^ case only. 
The horizontal dotted line segment marks the unconditional mean decrement, {D) = 0.36. 
For our cosmology, 1' corresponds to 1.19 Mpc (comoving) and Hrt = 152 km s~^ at 
z = 3. 
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Fig. 11. — The conditional mean flux decrement (cumulative form) for randomly selected 
dark matter particles (filled symbols) and for dark matter halos with different relative weight- 
ings (open symbols, with weights as marked in the legend), computed from the L50nl44 
simulation. The dotted horizontal line segment shows the unconditional mean decrement 
(D) = 0.36. 
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Fig. 12. — Potential impact of galaxy photoionization on the conditional mean flux decrement 
(left) and the saturated pixel fraction (right), for the E = 1 arcmin"^ A^;"^ sample of the 
L50nl44 simulation and a spectral smoothing of 10 km s~^. We assume that these galaxies 
produce a fraction / = 0.5 of the UV background at z — 3, with each galaxy's photoionizing 
flux proportional to its SFR. Pentagons show results in the absence of galaxy photoionization, 
asterisks show the results of the approximate calculation described in the text, and triangles 
show the results of a complete calculation with the UV sources embedded in the simulation 
before spectra are extracted. The complete calculation yields a much weaker effect because 
peculiar velocities allow relatively distant gas to produce absorption at the galaxy redshift 
(see Fig. 13). Filled circles show results for the recurrent AGN model described in the text, 
for ^max = 0.25' and ^max = 0.5' only. Dotted horizontal segments show the mean decrement 
(D) — 0.36 or saturated fraction /gat = 0.086 for the unconditional flux PDF. 
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Fig. 13. — Distribution of the mean line-of-sight distance, weighted by optical depth, of 
gas producing absorption within 20 km s~^ of the galaxy redshift, calculated for lines of 
sight within 0.5' of galaxies in the S = 1 arcmin"^ A^;"^ sample of the L50nl44 simulation, 
assuming a uniform ionizing background. Left, middle, and right panels show three different 
ranges of the optical depth at the galaxy redshift, with the number of lines of sight as 
indicated. In each panel, histograms show the differential distribution and dotted lines the 
cumulative distribution (on a scale where the top of the panel is unity). Dashed lines show 
the cumulative distribution of photoionization influence radii for the 150 galaxies. Units 
are comoving Mpc. 
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Fig. 14. — Impact of redshift determination errors on the conditional mean decrement (left) 
and saturated pixel fraction (right). Upper panels show our standard, cumulative form of 
these statistics, computed for all galaxy-pixel pairs with angular separation A9 < 6'max, and 
lower panels show the differential form, computed for pairs with O.S^max < < 1.25^niax- 
In each panel, pentagons show results for the E = 1 arcmin"^ Az"^ sample of the L50nl44 
galaxy population in the absence of redshift errors. Squares and triangles show the corre- 
sponding results when estimated galaxy redshifts are drawn from a Gaussian distribution of 
la dispersion 150 km s~^ or 300 km s~^, respectively, centered on the true galaxy redshifts. 
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Fig. 15. — Large scale correlations of galaxy and dark matter overdensity with Lya flux 
decrement, in cubes of comoving size Wh-^ Mpc (left) and bh'^ Mpc (right). The mean 
decrement along a segment of Lya forest spectrum traversing the center of the cube is 
denoted Diq or D^, and filled (open) squares in the top panel show the mean value of the 
galaxy (dark matter) density contrast in cubes with a specified value of Diq or D5. Plots in 
the lower panels are normalized by subtracting the mean and dividing by the dispersion of 
the corresponding variable. Curves in the upper panels show the cumulative distribution of 
DiQ and D5, with values zero and one marked on the right axis. Upper panels list dispersions 
of the decrement, dark matter contrast, and galaxy contrast (with shot noise subtracted), 
and lower panels list cross-correlation coefficients. All error bars show la uncertainty in the 
mean; the dispersion of 6 values in a given bin of Dio or is much larger. 



-51 - 



2 - 



1 - 



T — I — I — I — I — r 



I ' ' ' I ' ' 

AMOo, CT=0.97 
□ M''•^ a =1.09 

OM'", CT=1.30 y 

OM'■^ (7 = 1.72 o 



o. 



#6^ 



_l I I I L_ 



0.2 0.4 0.6 0.8 1 



T — I — I — I — I — I — I — I — I — I — I — I — I — I — I — I — I — I — r 

AMOo, (7=1.86 o 
- □M0•^ (7=2.23 9 
OW°, CT=2.86 □ 

OMl-5, (7=4.10 A 



Oi 



® 



J I I I I I I I 1 I I I I I I I I I i_ 



0.2 



0.4 0.6 



0.8 1 




-2 2 4 
(D,o-D)/c7 



2 2 4 
(D.o-D)/(7 



Fig. 16. — Similar to Fig. 15, but for dark matter halos. Triangles show the case in which 
each of the 450 most massive halos contributes equally. Squares, pentagons, and circles show 
mass- weighted cases, with the contribution of each halo proportional to M°'^, M, and M^'^, 
respectively. Sohd dots show the results for galaxies, repeated from Fig. 15. 
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Fig. 17. — The conditional mean flux decrement, now defined by an average over all galaxy- 
pixel pairs in bins of comoving rcdshift-space separation. Open symbols show our results for 
the S = 1 arcmin^^ Az^^ sample of the L50nl44 simulation, with spectral smoothing length 
Rs = 10 km s^^ and no rcdshift errors (triangles), rms rcdshift errors of 150 km s^^ (squares) 
and 300km s^^ (pentagons). Solid and dotted curves show Croft et al.'s (2002a) results for 
galaxies with baryonic mass Mb > W^^Mq and > lO^^M©, respectively Filled symbols 
show the ASSP observational results, with approximate error bars estimated by a Monte 
Carlo procedure in which galaxy redshifts are perturbed by ~ 50 Mpc (comoving) to 
produce a "random" sample with the same transverse separations. 



-53- 



1 

0.8 
0.6 



lA 

A 



0.4 - 



0.2 



Q' 



1^. 







-5- 









A - 


R^=0.75 


□ - 




o - 




O - 






ASSP 


• - 



2 4 6 

(h-i Mpc) 



8 



Fig. 18. — Effects of simplified "winds" on tfie mean fiux decrement. Open symbols sliow 
our results for the S = 3 arcmin"^ Az~^ sample of the L22nl28 simulation, with spectral 
smoothing length Rg = 10km s~^ and wind radii of 0.75 (squares), 1.0 (pentagons), 1.5 
Mpc (hexagons) and proportional to M^^^ (stars). Open triangles show our results for 
no winds, and filled symbols show the ASSP observational results. 
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Fig. 19. — Statistical significance of the discrepancy with the innermost ASSP data point, 
which comes from three galaxy-QSO pairs. Vertical lines mark the ASSP measured value 
for Ar < 0.5 Mpc, (D) = 0.11, and the unconditional mean fiux decrement, (D) = 
0.36. For each model, we randomly select 500 3-tuples of close galaxy-spectrum pairs and 
compute the mean decrement for pixels with < 0.5 Mpc. Curves show the cumulative 
distribution of these mean decrement values, for our standard model with no redshift errors 
(solid, corresponding to the triangles in Fig. 17), the standard model with 300 km s~^ rms 
redshift errors (dotted, corresponding to pentagons in Fig. 17), and the i?^ = 1.5 hr^ Mpc 
wind model with no redshift errors (dashed, corresponding to the hexagons in Fig. 18). 



